Disordered Sound Repetition Recognition in Continuous Speech Using Cwt and Kohonen Network
نویسندگان
چکیده
Automatic disorders recognition in speech can be very helpful for therapist while monitoring therapy progress of patients with disordered speech. This article is focused on sound repetitions. The signal is analyzed using Continuous Wavelet Transform with 16 bark scales, the result is divided into vectors and passed into Kohonen network. Finally, the Kohonen winning neuron result is put on the 3-layer perceptron. The recognition ratio was increased by about 20% by adding a modification into the Kohonen network training process as well as into CWT computation algorithm. All the analysis was performed and the results were obtained using the authors’ program “WaveBlaster”. The problem presented in this article is a part of our research work aimed at creating an automatic disordered speech recognition system.
منابع مشابه
Automatic disordered sound repetition recognition in continuous speech using CWT and kohonen network
Automatic disorders recognition in speech can be very helpful for a therapist while monitoring therapy progress of patients with disordered speech. This article is focused on sound repetitions. The signal is analyzed using Continuous Wavelet Transform with 16 bark scales. Using the silence finding algorithm, only speech fragments are automatically found and cut. Each cut fragment is converted i...
متن کاملProlongation Recognition in Disordered Speech using CWT and Kohonen Network
Automatic disorder recognition in speech can be very helpful for the therapist while monitoring therapy progress of the patients with disordered speech. In this article we focus on prolongations. We analyze the signal using Continuous Wavelet Transform with 18 bark scales, we divide the result into vectors (using windowing) and then we pass such vectors into Kohonen network. Quite large search ...
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملComparison of Time-Frequency Representations for Environmental Sound Classification using Convolutional Neural Networks
Recent successful applications of convolutional neural networks (CNNs) to audio classification and speech recognition have motivated the search for better input representations for more efficient training. Visual displays of an audio signal, through various time-frequency representations such as spectrograms offer a rich representation of the temporal and spectral structure of the original sign...
متن کاملSingle Word Quick Repetition Test for Speech Sound Production in Farsi Speaking Children: Development, Reliability and Validity
Objective: The aims of the present study was to develop and validate a quick repetition test using familiar, culture-based words for the assessment of the Farsi speaking children suffering from Speech Sound Disorders (SSD). Method: The study population was divided into two groups, including 419 participants without any clinical signs of speech impairment and 36 cases with SSD. Psychometric pro...
متن کامل